-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test CheckpointFileTransfer
from recipes PR
#7
base: main
Are you sure you want to change the base?
Conversation
Getting some errors like this:
I think this is due to the fact that I only provide a url, not a |
Not a bad idea. Still, I wonder what's going wrong. Later in the process (in the Perhaps relevant that it opens with |
So weird that this is happening in only some elements! |
Oh shoot! Wrapping the url in |
Note that I did not use cache_target = CacheFSSpecTarget(
fs = gcsfs.GCSFileSystem(),
root_path="gs://leap-scratch/data-library/feedstocks/cache_concurrent"
) |
This finding is super relevant for the upstream PR. I'll see if I can't drum up a test case to reveal the unexpected behavior with |
Ok so I was able to run a complete lowres-mli here, with https-sync patch activated for both the caching and the openwith fsspec but I want the download to be faster. Disabling the https-sync patch and setting concurrency to 20 gives me a bunch of these:
Wondering if this goes away if I reduce the concurrency. |
@moradology should we track this in a separate issue? Just asking since I expect to close this PR soon. |
Issue up here: pangeo-forge/pangeo-forge-recipes#752 |
Yoinks I am all the sudden getting a lot of failed transfers (for the mlo dataset). Not entirely sure if I am getting rate limited because I just downloaded 800GB of data in a short succession, or if one of the many alterations here screwed somethign. Have now submitted a job with reduced concurrency for now, and will wait until tomorrow to continue. |
I just tried to freeze the actual commit hash for the requirements and increase concurrency (all files are cached rn). |
Testing pangeo-forge/pangeo-forge-recipes#750.
I did the following here:
Todo:
leap-scratch/data-library/feedstocks/cache_concurrent/000b7ecb864a18a4a2b56492d8cf35d4-https_huggingface.co_datasets_leap_climsim_low-res_resolve_main_train_0001-05_e3sm-mmf.mli.0001-05-11-68400.nc
) - Confirmed here